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Abstract — The complexity of round robin iterative data 
flow analysis has been traditionally defined as 1 + d 
where d is the depth of a control flow graph. However, 
this bound is restricted to bit vector frameworks, which 
by definition, are separable. For non-separable frame- 
works, the complexity of analysis is influenced by the 
interdependences of program entities, hence the bound 
of 1 + d is not applicable. This motivates the need for 
capturing the interdependences of entities to define a 
general complexity measure. 

We propose Degree of dependence 8 which quanti- 
fies the effect of non-separability on the complexity of 
analysis for a particular problem instance. We define 
the complexity bound of 1 + 8 + d which explains the 
complexity of round robin analysis of general non- 
separable data flow problems. Like d, 8 is a theoretical 
concept useful for understanding the complexity rather 
than estimating it. In bit vector frameworks the bound 
1 + 8 + d reduces to 1 +d due to 8 = 0. Apart from being 
general, our bound is also precise, as corroborated by 
empirical results. 

Index Terms — Data flow analysis, Complexity, Con- 
stant propagation 

I. Introduction 

For a given instance of a data flow framework, the 
complexity of performing data flow analysis depends on 
a combination of the properties of program structure 
(in particular, loops and their complex arrangements), 
separability and boundedness of flow functions, height 
of the lattice, and the order of traversal over the control 
flow graph. Among the general complexity measures for 
data flow analysis ([1], [3], [7], [12], [13], [17], [19], 
[22]) the number of iterations required for convergence 
of round robin iterative method has been explored in 
details. The traditional complexity bound of round robin 
data flow analysis for bit vector frameworks has been 
defined as 1 + d iterations ([1], [7], [12]), where d is 
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the depth of the control flow graph which is defined as 
the maximum number of back edges in any acyclic path 1 . 

Depth d is not independent of the problem instance 
since it captures the structure of control flow. However, 
it does not depend on the program statements. Since 
bit-vector problems are separable 2-bounded problems, 
program statements do not play a significant role in 
increasing the complexity. Hence their complexity can 
be reasonably explained using only the depth d. In non- 
separable frameworks like constant propagation ([1], [7], 
[18]), faint variables analysis ([8], [15]), type inferencing 
of flow sensitive types [11], and points-to analysis ([4], 
[10]) the complexity of analysis is influenced by the 
interdependences among program entities, which in turn 
are heavily influenced by the placement of program 
statements, apart from depth. Complexity of analyzing 
such problems has been observed to be proportional to 
number of program entities, but no strict bound has been 
reported. 

We define an Entity Dependence Graph to capture the 
non-separable information flows in the given program 
instance. We propose 8, the degree of dependence as a 
measure of effect of non-separability on complexity. We 
use 8 to define a complexity bound of 1 + 8 + d which 
is the first ever realistic explanation of the complexity of 
data flow analysis for non-separable frameworks. Note 
that like d, 8 is a purely theoretical concept which is used 
for explanation rather than for estimation of complexity. 
Our complexity bound is uniformly applicable to a 
large class of data flow frameworks. In particular, the 
separability of bit vector frameworks can be viewed as 
a special case of non-separability, due to which 8 = 
and the bound 1 + 8 + d reduces to 1 + d. Apart from 
being general, our measure is precise. Our empirical 
measurements for constant propagation and faint vari- 
ables analysis corroborate this. 

The rest of the paper is organized as follows: Sec- 
tion |H] reviews the theory of data flow analysis. Sec- 
tion |lll] introduces the entity dependence graph. Sec- 

'For bidirectional frameworks, this has been generalized to 1 + w 
([3], [12], [13]) where w is the maximum width of any information 
flow path. 



tion IIVI defines the degree of dependence and derives 
the bound of 1 + 8 + cf. Section [V] provides empirical 
measurements. 

II. Data Flow Frameworks 

A data flow framework D is a tuple (L,\1,F) ([1], [6], 
[7], [9], [12], [14], [18]), where L is a lattice with meet 
l~l and F is the set of flow functions L^L. L is the set 
of data flow values with the partial order C induced by 
l~l and has a top element T and a bottom element _L. 
An instance Id, of a data flow framework D, is a pair 
(G,M) where G is a control flow graph (CFG) and M 
is a mapping from the nodes/edges of the CFG to the 
functions in F. Note that D defines only the structure of 
L, it is actually instantiated by Id depending upon the 
actual number of entities (eg. variables or expressions 
etc.). 



(T) 
undef 

/ / I \ \ 

50 10 1 • • • OO 

\ \ I / X 

nonconst 

(1) 

Component Lattice L 



f + 


f 


C2 


1 


T 


f 


f 


1 


Cl 


f 


Cl + c 2 


1 


i 


i 


± 


1 



(b) Flow function for + 



n 


f 


C2 


1 


f 


f 


C2 


1 


Cl 


Cl 


Cl = c 2=>Ci 

ci ^ c 2 =>i 


1 


i 


_L 


± 


1 



Fig. 1. 



(c) Confluence operation 
Constant propagation framework. 



A. Properties of Lattices 

For a lattice L, a strictly descending chain is a se- 
quence vo □ v\ □ . . . □ v„ such that v,- G L. Analogously, 
a strictly ascending chain is v„ C v„_i C . . . □ vq. The 
lattices in which all strictly descending chains are finite 
have been called bounded in ([9], [14]) and finite in [7]. 
The lattices in which both strictly descending as well as 
strictly ascending chains are finite have been called^zn/fe 
in ([12], [13]) and complete in ([21]). In the rest of this 
paper, by bounded lattices we mean complete lattices. 

The overall lattice L is a product of component lattices 
L \ x L.2 x ••• x L^, where L, is a lattice of the values 
of data flow properties of an individual entity a,-, and 
£, is the number of entities. The T,l eL are tuples 
(T 1 ,T 2 ,...,T^) and (_Li,_L 2 >---,-L|)- In the case of 
available expressions analysis, for a given expression e, 
T is {e} and _L is 0. n and C are fl and C respectively. 
In the case of reaching definitions analysis, for a given 
definition di : x = e, T is and _L is {di}. Further, n 
and C are U and D respectively. 

The height of a lattice is the maximum number of C 
or □ in any strict chain. Let //, denote height of L,-. 
Often all L, are same and the height H of the overall 
lattice L is: 

H = Hjxt, (1) 

B. Properties of Flow Functions 

Flow functions are monotonic ([1], [6], [7], [9], [12], 
[14], [18]) if: 

V/ e F, Wx,y £ L : x C y => f(x) E f(y) (2) 

If the flow functions / : L L in F are tuples of func- 
tions / = (h i,/z2j ■ ■ ■ ,hk) such that h, : Li i— > L/, then the 
framework is separable ([12], [13]) in that the functions 
on one component lattice operates independently of 
others. 



The flow functions are k-bounded if their loop closures 
([12], [13], [17]) (also called/asfness closures in [6], [7], 
[21]) are bounded by constant k. Let —fop and f° 
be the identity function. / ; represents the flow function 
corresponding to the path containing j traversals over a 
loop. £-boundedness requires that 

3k>ls.t.Vf£F: f /■ f ... fl/' (3) 

i=0 

fc-boundedness implies that though a program contains 
infinite paths in the presence of loops, a finite number 
of paths containing upto k — 1 unfoldings of loops are 
sufficient for convergence of data flow analysis. 

The computation of f°r\f l \lf 2 r\ ■ ■ ■ Fif represents the 
greatest lower bound (gib) of the results of first i 
applications of /. Thus it follows a descending chain in 
L. Hence bounded lattices imply bounded loop closures. 

For separable frameworks, values in L change simul- 
taneously due to independence of the component lattices. 
It is easy to verify that for a ^-bounded framework, 

, < ( H + 1 if the framework is separable 
— [ H + 1 if the framework is non-separable 

(4) 

Bit vector frameworks have separable flow functions 
which operate on L with H = 1 . Further, since all flow 
functions in bit vector frameworks are either constant 
functions or identity, f 2 (x) —f(x) implying that k is 2 
for bit vector frameworks. 

C. Examples of Non-Separable Frameworks 

In this section, we briefly introduce constant prop- 
agation and faint variables analysis which are used as 
running examples in the paper. 

Constant propagation [1], [7], [18] identifies vari- 
ables which hold a fixed constant value and replaces 
them by this value. Lattice L of data flow values 



• Lhs n : Singleton set containing LHS variable of 
statement n. 

• Rhs n : Set of variables used in RHS expression of 
statement n which is an assignment statement. 
For non-assignment statements like print and for 
function calls, this set is empty. 

• Use n : Set of variables used in statement n, 
where n is either a non-assignment statement or 
a function call. 

• FaintGen n / FaintKillff : Set of variables whose 
faintness is generated/killed by statement n. 

• Faintln n / FaintOut n : Faintness information at 
entry/exit of node n. 

FaintGen n = {x \ x £ Lhs n ,x f{ Rhs n } 
FaintKilln = {x | .re Rhs n ,y £ Lhs n ,y ft FaintOut n } 
U {x | x £ Use n } 
Faintln n = (FaintOut n — FaintKill n ) U FaintGen n 

(T n is exit node 

P| Faintln s otherwise 
sZzSucc(n) 

Fig. 2. Data flow equations to determine faint variables 



Constant Propagation 

w is 2 at the exit of 4. 

No variable is constant 

at any other point. 
- H = 2, H = 8, w = 3 

Predicted bound : 25 iterations 

Actual iterations : 9 

If the assignments in nodes 5 

and 7 are exchanged, H, H, 

and d (hence predicted bound) 

remain same. Actual number of 

iterations reduces to 5. 



Faint Variables Analysis 

Due to the print statement, no 
variable is faint at any point. 

- H = 1,H = 4, w = 3 

Predicted bound : 13 iterations 
Actual iterations : 7 j 
If the assignments in nodes 5 and 7 
are exchanged, H, H, and d 
(hence predicted bound) remain same 
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Actual number of iterations reduces to 5. 
Fig. 3. Complexity bounds for non-separable analyses are very loose. 



for a variable is shown in Figure [Da), undef de- 
notes an undefined value and forms the T whereas 
nonconst denotes a non-constant value and forms the 
_L. All constants c,, — °° < c, < °° are ordered such that 
c, C T and _L C c,-. Observe that this lattice has an in- 
finite number of elements but a finite height: H = 2 
and H = 2 x Confluence operation for constant prop- 
agation is shown in Figure He)- Flow functions which 
influence the data flow properties of variable correspond 
to statements such as x = y, x = y + z, x — 3 or x — 
read(). For the first two statements, the flow functions 
compute the data flow value of x from the data flow 
values of the variables appearing in the right hand side. 
For the last two, the data flow values of x are C3 
and nonconst respectively. Figure ^b) shows the flow 
function corresponding to a statement x = y + z- Observe 
that it is of the form (L x x L y ) 1— > L z rather than L 1— > L. 
Due to non-separability, its loop closure bound k is H+ 1 
and is not constant. 

Faint variables analysis [8], [15] is a more general 
variant of live variables analysis in that it computes the 
transitive closure of dead variables. A variable is faint if 
it is dead or if it is only used to compute new values of 
faint variables. Variables which are not faint are strongly 
live [20]. 

Data flow equations to identify faint variables are 
given in Fig. [2] For a variable v, T is {v}, _L is 
and H = 1. Though these data flow properties can be 
represented using bit vectors, the flow functions are non- 
separable. Hence this is not a bit vector framework. 



D. Simplistic Generalization of Complexity Bound 

Using only the properties of framework, a simplistic 
generalization of the complexity bound can be formu- 
lated as follows: In round robin method, in the worst 
case, each iteration may change exactly one data flow 
value. For general frameworks, at most H new values 
may be generated at any program point. Since com- 
putation of each new value may require at most d 
additional iterations, the total number of iterations is 
1 + (d x H). In a fc-bounded framework, at most (k—1) 
new values can be generated at a program point. Hence 
(k — 1) can replace H giving a possibly stricter bound 
of 1 + d x (k — 1) (see Equation |4jl. This has already 
been conjectured in [12]. However, since k may not be 
known, we continue to use 1 + (d xH). For separable 
frameworks, it is 1 + (d x H). 

Example 1: Consider constant propagation and faint 
variables analysis for the CFG in Figure [3] Observe that 
the actual number of iterations is much smaller than the 
predicted bound. Besides, a rearrangement of statements 
causes a significant variation in the number of iterations 
and the predicted bound is insensitive to this change. 

Separability guarantees that the data flow properties 
of all entities are independent of each other whereas 
handling non-separability requires the assumption that 
in the worst case, the data flow property of every entity 
depends on the data flow properties of all entities. We 
view these two extreme cases of dependence as two 
extremes of the same continuous spectrum by modeling 
the exact cause of non-separability explicitly and by 



defining the degree of dependence as a measure of non- 
separability. The degree of dependence is defined for a 
particular problem instance using the Entity Dependence 
Graph which we introduce in the next section. 

III. Entity Dependence Graph 

Let the flow function corresponding to a statement s 
be /. Let dfpmodf denote the set of entities which occur 
in s and whose data flow properties are computed by /. 
Note that for the flow of information, this computation 
should potentially compute a non-T value because in- 
formation flow consists of propagating non-T value. Let 
dfpusef denote the set of entities which occur in s and 
whose data flow properties are used by /. 

Formally, an entity dependence graph (EDG) is a 
directed graph G e d g — (N e d g ,E e d g ). N e d g is the set of 
entities defined as follows: 

Nedg = {oc s | oc G dfpmodf, f is a flow function for 
statement s} 

a is the name of an entity whereas oc s represents its 
instantiation for statement s. a v s denotes entity a s with 
value v. Different instances of the same entity may have 
to be created in the following situations: 

1) When a data flow problem requires such a dis- 
tinction. For example, reaching definitions analysis 
requires enumeration of all definitions of each 
variable reaching a program point. 

2) When different instances of the same entity reach- 
ing a program point have different dependence 
relations and hence different influences on the 
complexity. 

E ec i g is the set of edges between entities. An edge 
a,- — > By indicates that the data flow property of entity Bj 
directly depends on the data flow property of a,, where 
i and j may not be adjacent in CFG. aj — * B" indicates 
that value u of B ; - is due to the direct influence of value 
v of a,. Constructing a dependence a, — > B ; - requires: 

• identifying the presence of a flow function / at a 
program point p associated with statement j such 
that Bj e dfpmodf and a e dfpusef, and 

• discovering the instance a, reaching s for 
a e dfpusef. 

For information propagation, each EDG must contain 
some entry node i.e. a node with no predecessors: Some 
entity must change from T to non-T independently, 
otherwise all entities would remain T. In the latter 
situation, data flow analysis need not be performed. 
For separable frameworks EDG-edges can exist only 
between instances of the same entity because data flow 
properties of an entity change independently of other 
entities. 

Example 2: Some EDGs for the CFG in Figure [3] are 
as described below: 

Available Expressions Analysis: In a bit vector frame- 
work, flow functions are either constant functions or 




(a) EDG for Constant (b) EDG for Faint 
propagation problem variables problem 

Fig. 4. Entity Dependence Graphs for the CFG in Figure [3] Edges 
are labeled with edge-weights. 



identity and all entities are independent of others. Hence 
EDG has no edges. For available expressions analysis, an 
entity oc v represents the fact that expression a is killed in 
statement s. For the instance in Figure[5] the set of EDG- 
nodes is {(y-r-2) 6 ,(z + 3)7,(w-l)i,(w- l) g ,(je+l) 5 } 
and the set of edges is 0. 

Constant Propagation: For constant propagation, an 
entity is identified by a variable name and statement 
number associated with a definition of that variable. 
The resulting EDG is shown in Fig. EJa). Nodes wg 
and w\ correspond to definitions of w in statements 8 
and 1 respectively. A reaching definitions analysis using 
the renamed definitions can identify the EDG-edges. 
An edge label represents weight of an edge (defined in 
Section HVt. 

Faint Variables Analysis: Figure 0Jb) shows the EDG. 
The uses of x in statements 2 and 8 are treated as 
different entities denoted by X2 and xg, respectively. 
An assignment s : a = b adds dependence edges a, — > b s 
such that a,- is live at exit of s. Identifying dependence 
edges requires live variables analysis with renamed uses. 

An EDG-path a, -t> B 7 represents a transitive influence 
of a,- on B ; . Cycles in EDG represent self-dependences. 
Interestingly, a cyclic EDG-path may differ from a cyclic 
CFG-path. For example, in the EDG for constant prop- 
agation in Figure |4j a), path (zj ,ye,X5,w&, wj) captures 
the cyclic dependence among entities zi, ye, X5, w$, 
and zi- The CFG in Figure contains many cycles 
not necessarily coinciding with this EDG-cycle. Further, 
each edge in the EDG-cycle corresponds to a cyclic path 
involving the back edge 8 — > 5 in the CFG. 

Observe that the dependence captured by EDG is 
different from the dependences explored in past. Control 
dependence[5] captures the dependence of execution of 
a statement on other statements. Data dependences [16], 
Def-Use and Use-Def chains[l], SSA edges[2] etc. cap- 
ture the dependence of a statement on the data computed 
in other statements. These traditional dependences do not 
capture the dependence of data flow properties and hence 
cannot be used for complexity analysis. 



IV. Defining Complexity Using The Degree of 
Dependence 

For an EDG edge a, — > P ; , edge-weight denoted as 
Wt((Xi — > pj) is defined as the maximum number of 
back edges in any acyclic path from stmt i to stmt j for 
forward problems, and from stmt j to stmt ; for backward 
problems 2 . 

Example 3: All edges in the EDG shown in Figure 
are labeled with their weights. In the EDG for faint 
variables analysis, edge X2 — > ys has weight 3 since the 
path from stmt 5 to stmt 2 contains three back-edges. 
Edge x% — > ys has depth because of the back-edge-free 
path from stmt 5 to stmt 8. 

The weight of EDG-psth 0C,->Py denoted 
Wt((Xj ->Py), is defined as the sum of weights of 
the edges along the path. It is defined only for an 
acyclic path with the relaxation that OCj and Py may 
be same. The degree of dependence of an EDG-path 
a, ->Py, denoted A(a, -t>P;) is defined according to the 
structure of the path as follows: 

• If the path ot;-t>p 7 - is acyclic, A(oc,- -f>P ; -) = 
W(fH 

• If the path a, is acyclic except that py is same 
as a,, then a change in a, has to be propagated to 
all entities along the path, including a,. Doing so 
once may require Wt (a,- -E>P;) additional iterations. 
This may change a, further. The maximum number 
of changes is H, and 

A(0C; ->py) = H X Wt(a t -E>P;) (5) 

In case of overlapping cycles, the maximum value 
of A(a, -t> a,-) is considered. 

• If the path a, -i>p 7 contains m non-overlapping cy- 
cles connected by m — 1 acyclic segments, it has the 
following structure: 

a,- h>Yi -E>yi -t> Y2 -e> Y2 -> • "Yffl ~>y m -> Py (6) 
Then A (a,- ->P/) is defined as: 

m m—l 
A(oc,- ->Py) = £ A (Y« Y») + E A (Y« -> Y»+i) 

n=l n = 1 

+A(a,--t>Yi)+A(Ym-E>P>) (7) 

A (a,- -E> p ; ) computes the maximum number of itera- 
tions required to propagate the influence of a change in 
the value of the data flow property of a, on the value 
of the data flow property of P ; along the path. Let oco 
denote an entry node of the EDG for an instance Iq for 
a given D. The degree of dependence of I& is denoted 
by 8 and is defined as follows: 

8 = max(A(ao -> P;)), for any oco and P ; - (8) 

2 for bidirectional problems, the notion of width [13] must be used. 



In the case of practical non-separable problems like 
constant propagation and faint variables analysis, 8 can 
be made more precise. We define the entity dependence 
in a non-separable framework to be monotonic if 

VocJ -> P}\ ht{u) > ht(v) (9) 

ht(v) denotes the height of v in the component lattice 
and is defined as the length of a longest descending 
chain from f to v. For Y = f(X), let X,Y 6 L be 
{Xi,X 2 ,...,X$ and (Fi,F 2} ...,F|). Let X t determine 
the value of Yj. Then the possible values of Yj are limited 
to those at same height as X{ or with higher height 
(effectively lower in lattice). 

Example 4: For constant propagation, let c denote 
any constant. Then the longest strictly descending chain 
in L is TZIcID-L with heights of elements (0,1,2) 
respectively. For some OcV — > P", condition (|9} can only 

be violated if (v,u) are (c,T), (~L,c) or (_L,T). However 
this is not possible implying that constant propagation 
has monotonic entity dependence. For faint variables 
analysis, the corresponding chain is T □ _L. It can be 
easily verified that the entity dependence is monotonic. 

Theorem 1:. For a data flow framework with mono- 

m 

tonic entity dependence, the term VA(y, -t>Y«) m 

Equation reduces to H x max(l/W(Y„ -E>Y«))- 
Proof : In path structure (|6j, the multiplication factor H 
in A(y„ -t>Yn) (Equation |5} is required only if the flow 
is Y, 1 , -^l" sucn that ht(u) >ht(v). Due to monotonic 
entity dependence, the values computed in each cycle 
will get progressively limited and instead of H changes 
in each cycle, the changes will get distributed over m 
cycles. Let the number of changes in cycle y n ->y n be 
c„. Then we need to maximize the term 

m m 
Y^ c n X Wt(y„ ~>Y«) SUCri mat Y c n=H 

The constraint on c„ defines a hyperplane and a max- 
imum occurs at some extreme point of the plane. 
Hence the maximum value of the summation is H x 
max(l/W(Y„ ->y n )). □ 
Theorem 2:. A round robin data flow analysis of an 
instance fo of a data flow framework D would converge 
in at most 1 + 8 + d iterations. 

Proof : The first change in every ao must happen in the 
very first iteration because it does not depend on any 
other entity. From Equation l|8}, the maximum number 
of iterations required for computing the final value of the 
data flow property of any entity a, is 1 + 8. However, it is 
computed at statement i and also needs to be propagated 
to those statements in the CFG for which this entity 
may not occur in dfpuse/. This may requires at most d 
additional iterations taking the bound to 1 + 8 + d. □ 



Deviation of simplistic bound from actual iterations. 
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(a) Summary of complexity computation for various bench- 
marks: #F is the number of functions, #V is the average 
number of variables. All other values are average values: B\ 
is the predicted number of iterations using 1 + d x H, B2 is the 
predicted number of iterations using 1 + 8 + d, I is the actual 
number of iterations. 
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Fig. 5. Empirical measurements 



Corollary 1:. For bit vector frameworks, the bound 
in Theorem Q reduces to 1 + d. 

Proof : For bit vector frameworks, EDG does not contain 
any edges. Hence 8 = 0. □ 

Example 5: In the EDG for constant propagation in 
Figure |4j a), Oo = d\. Since H = 2, A(oco-E>P;) for 
each fj ; in {wi,ZT,y6,X5,w%} is {0,6,6,6,6} respectively. 
Hence 8 = 6. Depth of the CFG is d = 3. Thus complex- 
ity bound (l+8 + d) is 10. This compares well with the 
actual number of iterations which is 9. 

In the EDG for faint variables analysis in Figure |4jb), 
ao=fl2- Since H = 1, A(ao-r>Py) for each p 7 in 
{x2,y5,Z6,Wj,x%} is {0,6,6,6,6} respectively. Hence 8 = 
6. Using d = 3, we get the overall complexity bound 10. 
The actual number of iterations is 7. 

V. Empirical Measurements 

We have implemented the complexity computation 
and round robin data flow analysis for constant propa- 
gation as well as faint variables analysis to measure the 
precision of the bound 1 + 8 + d for practical programs. 
Our prototype implementation uses XSB Prolog 3 . The 

3 Available from http : / / xsb . source forge . net 



input is obtained by translating gimple IR produced by 
gcc version 4.0.0. 

We have tested our implementation on SPEC-2000 
C benchmark programs, whetstone and digital signal 
processing benchmarks such as gsm. Since the imple- 
mentation is restricted to intraprocedural level, we make 
conservative assumptions for values of global variables 
used in expressions. Since our goal is not to use the 
information for transformation but to measure the pre- 
cision of our bound, aliases are ignored. We have also 
computed the simplistic bound (Section [5} 1 + d x H 
where // = 2x |V| for constant propagation. 

Figure |5ja) summarizes the results for constant propa- 
gation presenting averages of various numbers. The table 
excludes the information of acyclic programs since for 
acyclic programs, the bounds are trivially 1 . Clearly, our 
bound is very close to the actual number of iterations. 
Figures |3b) and (c) plot the number of functions against 
the deviations of the bounds from the actual number of 
iterations. Clearly, the bound 1 + 8 + d has very small 
deviation (0 to 2) for a large number (87.11%) of 
programs. The bound 1 + d x H has large deviations for 
most of the programs. In particular, small deviation (0 to 
2) is found in only 1.28% cases. The deviations of both 



bounds is for acyclic program and we have excluded 
them. 

VI. Conclusions and Future Work 

Non-separability of data flow framework is a dominant 
factor influencing the complexity of round robin data 
flow analysis. The existing theory accounts for lattice 
height and loop closure bounds in determining com- 
plexity bounds. However, it fails to capture the exact 
role played by non-separability. This paper proposes the 
concept of degree of dependence which is a more precise 
measure since (a) It uses the height of the component 
lattice instead of overall lattice while considering the 
cumulative effect only for interdependent entities, and 
(b) It distinguishes the cyclic CFG paths from the 
cyclic dependences. Apart from precision, these distinc- 
tions facilitate generality by placing separable and non- 
separable frameworks on the same continuous spectrum. 
This provides a uniform explanation of the phenomena 
observed in a large class of practical instances of a 
variety of data flow frameworks. 

We would like to extend this work to point-to anal- 
ysis [4], [10] of C and C++ where new entity depen- 
dences are discovered during analysis due to pointer 
indirections. Yet another interesting direction of future 
work is to explore the use of entity dependence graph 
for performing data flow analysis. 
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